A method for simultaneous variable selection and outlier identification in linear regression

نویسندگان
چکیده

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A method for simultaneous variable selection and outlier identification in linear regression*

We suggest a method for simultaneous variable selection and outlier identification based on the computation of posterior model probabilities. This avoids the problem that the model you select depends upon the order in which variable selection and outlier identification are carried out. Our method can find multiple outliers and appears to be successful in identifying masked outliers. We also add...

متن کامل

A diagnostic method for simultaneous feature selection and outlier identification in linear regression

A diagnostic method along the lines of forward search is proposed to simultaneously study the effect of individual observations and features on the inferences made in linear regression. The method operates by appending dummy variables to the data matrix and performing backward selection on the augmented matrix. It outputs sequences of feature–outlier combinations which can be evaluated by plots...

متن کامل

A Novel Resampling Method for Variable Selection in Robust Regression

Variable selection in regression analysis is of vital importance for data analyst and researcher to fit the parsimonious regression model. With the inundation of large number of predictor variables and large data sets requiring analysis and empirical modeling, contamination becomes usual problem. Accordingly, robust regression estimators are designed to easily fit contaminated data sets. In the...

متن کامل

Variable selection in linear regression through adaptive penalty selection

Model selection procedures often use a fixed penalty, such as Mallows’ Cp, to avoid choosing a model which fits a particular data set extremely well. These procedures are often devised to give an unbiased risk estimate when a particular chosen model is used to predict future responses. As a correction for not including the variability induced in model selection, generalized degrees of freedom i...

متن کامل

A statistical test for outlier identification in data envelopment analysis

In the use of peer group data to assess individual, typical or best practice performance, the effective detection of outliers is critical for achieving useful results. In these ‘‘deterministic’’ frontier models, statistical theory is now mostly available. This paper deals with the statistical pared sample method and its capability of detecting outliers in data envelopment analysis. In the prese...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Computational Statistics & Data Analysis

سال: 1996

ISSN: 0167-9473

DOI: 10.1016/0167-9473(95)00053-4